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Abstract — The distributed multiple-input-multiple-output 
(MIMO) techniques across multiple cooperative base- 
stations (BS) can significantly enhance the capability of 
the broadband wireless networks in terms of quality-of-service 
(QoS) provisioning for wireless data transmissions. However, 
the computational complexity and the interfering range of the 
distributed MIMO systems also increase rapidly as the number 
of cooperative BS's increases. In this paper, we propose the 
QoS-aware BS-selection schemes for the distributed wireless 
MIMO links, which aim at minimizing the BS usages and 
reducing the interfering range, while satisfying diverse statistical 
delay-QoS constraints characterized by the delay-bound 
violation probability and the effective capacity technique. In 
particular, based on the channel state information (CSI) and 
QoS requirements, a subset of BS with variable cardinality for 
the distributed MIMO transmission is dynamically selected, 
where the selections are controlled by a central server. For the 
single-user scenario, we develop two optimization frameworks, 
respectively, to derive the efficient BS-selection schemes and the 
corresponding resource allocation algorithms. One framework 
uses the incremental BS-selection and time-sharing (IBS-TS) 
strategies, and the other employs the ordered-gain based 
BS-selection and probabilistic transmissions (OGBS-PT). The 
IBS-TS framework can yield better performance, while the 
scheme developed under the OGBS-PT framework is easier 
to implement. For the multi-user scenario, we propose the 
optimization framework applying the priority BS-selection, 
block-diagonalization precoding, and probabilistic transmission 
(PBS-BD-PT) techniques. We also propose the optimization 
framework applying the priority BS-selection, time-division- 
multiple-access, and probabilistic transmission (PBS-TDMA-PT) 
techniques. We derive the optimal transmission schemes for all 
the aforementioned frameworks, respectively. Also conducted is 
a set of simulation evaluations which compare our proposed 
schemes with several baseline schemes and show the impact of 
the delay-QoS requirements, transmit power, and traffic loads 
on the performances of BS selections for distributed MIMO 
systems. 

Index Terms — Distributed MIMO, broadband wireless net- 
works, statistical QoS provisioning, wireless fading channels. 

I. Introduction 

IN ORDER to increase the coverage of broadband wire- 
less networks, distributed multiple-input-multiple-output 
(MIMO) techniques, where multiple location-independent base 
stations (BS) cooperatively transmit data to mobile users, 
have attracted more and more research attentions [l]-[4]. In 
particular, the distributed MIMO techniques can effectively 
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organize multiple location-independent BS's to form the dis- 
tributed MIMO links connecting with mobile users, while 
not requiring too many multi-antennas, which are expensive, 
equipped at individual BS's. Like the conventional centralized 
MIMO system [5]-[7], the distributed MIMO system can 
significantly enhance the capability of the broadband wireless 
networks in terms of the quality-of-service (QoS) provisioning 
for wireless transmissions as compared to the single antenna 
system. However, the distributed nature for cooperative multi- 
BS transmissions also imposes many new challenges in wide- 
band wireless communications, which are not encountered in 
the centralized MIMO systems. 

First, the cooperative distributed transmissions cause the 
severe difficulty for synchronization among multiple location- 
independent BS transmitters. Second, as the number of cooper- 
ative BS's increases, the computational complexity for MIMO 
signal processing and coding also grow rapidly. Third, because 
the coordinated BS's are located at different geographical 
positions, the cooperative communications in fact enlarge 
the interfering areas for the used spectrum, thus drastically 
degrading the frequency-reuse efficiency in the spatial domain. 
Finally, many wide-band transmissions are sensitive to the 
delay, and thus we need to design QoS-aware distributed 
MIMO techniques, such that the scarce wireless resources can 
be more efficiently utilized. 

Towards the above issues, many research works on dis- 
tributed MIMO transmissions have been proposed recently. 
The feasibility of transmit beamforming with efficient syn- 
chronization techniques over distributed MIMO link has been 
demonstrated through experimental tests and theoretical analy- 
ses [2], [3], suggesting that complicated MIMO signal process- 
ing techniques are promising to implement in realistic systems. 
While the antenna selection [6], [7] is an effective approach to 
reduce the complexity for centralized MIMO systems, which 
can be also extended to distributed MIMO systems for the 
BS selection. It is clear that the BS-selection techniques can 
significantly decrease the processing complexity, while still 
achieving high throughput gain over the single BS transmis- 
sion. Also, it is desirable to minimize the number of selected 
BS's through BS-selection techniques, which can effectively 
decrease the interfering range and thus improve the frequency- 
reuse efficiency of the entire wireless network. Most previous 
research works for BS/antenna selections mainly focused on 
the scenarios of selecting a subset of BS's/antennas with 
the fixed cardinality [4], [6], [7]. However, it is evident that 
based on the wireless-channel status, BS-subset selections with 
dynamically adjusted cardinality can further decrease the BS 
usage. More importantly, how to efficiently support diverse 
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delay-QoS requirements through BS-selection in distributed 
MIMO systems still remains as a widely cited open problem. 

To overcome the aforementioned problems, we propose the 
QoS-aware BS-selection schemes for the distributed wireless 
MIMO links, which aim at minimizing the BS usages and 
reducing the interfering range, while satisfying diverse statis- 
tical delay-QoS constraints. In particular, based on the channel 
state information (CSI) and QoS requirements, the subset 
of BS with variable cardinality for the distributed MIMO 
transmission is dynamically selected, where the selections are 
controlled by a central server. For the single-user scenario, 
we consider the optimization framework which uses the in- 
cremental BS-selection and time-sharing (IBS-TS) strategies, 
and study another framework which employs the ordered-gain 
based BS-selection and probabilistic-transmissions (OGBS- 
PT) techniques. For the multi-user scenario, we propose the 
optimization framework applying the priority BS-selection, 
block-diagonalization multiple-access, and probabilistic trans- 
mission (PBS-BD-PT) techniques. We also propose the opti- 
mization framework applying the priority BS-selection, time- 
division-multiple-access, and probabilistic transmission (PBS- 
TDMA-PT) techniques. We derive the optimal transmission 
schemes for the above frameworks, respectively, and conduct 
comparative analyses with the baseline schemes through sim- 
ulations. 

The rest of this paper is organized as follows. Section II 
describes the system model for distributed MIMO transmis- 
sions. Section III introduces the statistical QoS guarantees 
and the concept of effective capacity. Section IV proposes 
the optimization framework for QoS-aware BS sections of 
the single-user case and develops its corresponding optimal 
solution. Section V develops the optimization framework for 
multi-user case and derives its optimal solution. Section VI 
simulates our proposed schemes. The paper concludes with 
Section VII. 

II. System Model 
A. System Architecture 

We concentrate on the wireless distributed MIMO system 
for downlink transmissions depicted in Fig. 1, which consists 
of K\y S distributed BS's, K mu mobile users, and one central 
server. The mth BS has M m transmit antennas for m = 
1,2,..., K\, s and the nth mobile user has N n receive antennas 
for n — 1, 2, ... , K mu . All distributed BS's are connected to 
the central server through high-speed optical connections. The 
data to be delivered to the nth mobile user, n = 1,2,..., K mu , 
arrives at the central server with a constant rate, which is 
denoted by C n . Then, the central server dynamically controls 
these distributed BS's to cooperatively transmit data to the 
mobile users under the specified delay-QoS requirements. 

For the case of K mu = 1, the distributed BS's and 
the mobile user form a single wireless MIMO link; when 
K mu > 2, the distributed BS's and the mobile users form 
the broadcast MIMO link for data transmissions. The wireless 
fading channels between the mth BS and the nth mobile user 
is modeled by an N n x M m matrix H, vm . The element at the 
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Fig. 1 . System model of a wireless distributed MIMO system for downlink 
transmissions. 
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Fig. 2. Our proposed QoS-aware BS-selection framework. 



complex channel gain between the ith receive antenna of ?ith 
mobile user and the jth transmit antenna of the mth BS. All 
elements of H n m are independent and circularly symmetric 
complex Gaussian random variables with zero mean and the 
variance equal to h n ^ m . Also, the instantaneous aggregate 
power gain of the MIMO link between the nth mobile user 
and the mth BS, denoted by j n ,m, is defined by 



N n M„ 
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'•J 



= 1 j = l 
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We define H„ = [H„.i H„,2 ■ • ■ H„.x bs ] as the CSI for 
the nth mobile user for n = 1, 2, . . . , K mu . The matrix 
H„ follows the independent block-fading model, where H„ 
does not change within a time period with the fixed length 
T, called a time frame, but varies independently from one 
frame to the other frame. Furthermore, we define H = 
[H[ • • • H^ b ] T , representing a fading state of the entire 
distributed MIMO system, where the superscript r denotes the 
transpose operation on a matrix or a vector. 

Under the aforementioned model, our proposed BS- 
selection framework is illustrated in Fig. 2. As shown in 
Fig. 2, based on CSI feedback from the users, the central 
data server will dynamically select a subset of BS's and then 
use the transmit antennas of all selected BS's to construct the 
distributed channels to transmit data to the K mu users. Our BS- 



ith row and jth column of H, M „, denoted by (H njm ) t ., is the selection strategies and the corresponding resource allocation 
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algorithms depends not only on the CSI, but also the statistical 
delay-QoS constraints (to be detailed on Section II-B and 
Section III) for the incoming traffics. Although the BS/antenna 
selection techniques for distributed/centralized MIMO systems 
have been extensively studied, most existing works [4], [6], 
[7] focused on maximizing the capacity or minimizing the 
error rate given the specified BS/antenna subset cardinality. In 
contrast, our work in this paper aims at tackling the following 
new challenges for distributed MIMO system. 1) Our BS- 
selection and the associated resource allocation algorithms 
need to guarantee the specified delay-QoS requirements for the 
incoming traffics. 2) The cardinality of selected BS-subset vary 
with CSI and delay QoS, which is more flexible and efficient 
than the selection schemes with the fixed subset cardinality. 
3) We aim at using the minimum average number of BS's to 
support the incoming traffic with delay-QoS guarantees, which 
also decreases the interferences to the entire network. 

B. The Delay QoS Requirements 

The central data server maintains a queue for the incoming 
traffic to each mobile user. We mainly focus on the queueing 
delay in this paper because the wireless channel is the major 
bottleneck for high-rate wireless transmissions. Since it is 
usually unrealistic to guarantee the hard delay bound over the 
highly time-varying wireless channels, we employ the statis- 
tical metric, namely, the delay-bound violation probability, to 
characterize the diverse delay QoS requirements. Specifically, 
for the nth mobile user, the probability of violating a spec- 
ified delay bound, denoted by cannot exceed a given 
threshold £„. That is, the inequality 

Pr [D n > D$ }<U, n = 1, 2, . . . , N mu , (2) 

needs to hold, where D n denotes the queueing delay in the 
nth mobile user's queueing system. 

C. Performance Metrics and Design Objective 

We denote by L the cardinality of the selected BS subset 
(the number of selected BS's) for the distributed MIMO 
transmission in a fading state. Then, we denote the expectation 
of L by L and call it the average BS usage. As mentioned in 
Section II-A, our major objective is to minimize L through 
dynamic BS selection while guaranteeing the delay QoS 
constraint specified by Eq. (2). Besides the average BS usage, 
we also need to evaluate the average interfering range affected 
by the distributed MIMO transmission. The instantaneous 
interfering range, denoted by A, is defined as the area of the 
region where the average received power under the current 
MIMO transmission is larger than a certain threshold denoted 
by of h . The average interfering area is then defined as the 
expectation E{A} over all fading states. Clearly, minimizing 
L can not only reduce implementation complexity, but also 
decrease the average interfering range affected by the transmit 
power. 

D. The Power Control Strategy 

The transmit power of our distributed MIMO system varies 
with the number of selected BS's. In particular, given the 



number L of selected BS's, the total instantaneous transmitted 
power used for distributed MIMO transmissions is set as a 
constant equal to Vl- Furthermore, Vl linearly increases with 
L by using the strategy as follows: 

V L =Vrel + K{L-l), L=l,2,...,K hs , (3) 

where V le i > is called the reference power and n > 
describes the power increasing rate against L. Also, we define 
Vl = for L = 0. The above power adaptation strategy 
is simple to implement, while the average transmit power 
can be effectively decreased through minimizing the average 
number of used BS's. In addition, Eq. (3) can upper-bound 
the instantaneous interferences and the interfering range over 
the entire network. 

III. Effective Capacity Approach for Statistical 
Delay-QoS Guarantees 

In this paper, we apply the theory of statistical QoS Guar- 
antees [8], [9], [13], [20] to integrate the constraint on delay- 
bound violation probability given by Eq. (2) into our BS selec- 
tion design. Consider a stable dynamic discrete-time queueing 
system. The data arrival-rate and data service-rate of the 
queueing system are denoted by C[k] and R[k], respectively, 
where k is the index for the time frame with a fixed time- 
duration equal to T (as also described in Section II-A) and 
the units of C[k] and R[k] are nats/frame. The C[k] and R[k] 
change from frame to frame and thus can be characterized as 
the time-varying processes. By using the asymptotic analyses 
based on the large deviation principal, the author of [8] showed 
that under the sufficient conditions, the queue-length process 
as a function of t, denoted by Q[k], converges in distribution 
to a random variable Q satisfying the following equation: 

- lim l0g ( Pr ^ > Q ^ = 9, (4) 

for a certain 9 > 0. A set of sufficient conditions given in [8] 
for Eq. (4) are summarized as follows. 1) Both C[k) and 
R[k) are stationary. 2) The Gartner-Ellis limits [8] for C[k] 
and lZ[k], denoted by Ac(0) and Ar(6), respectively, exist 
for all 0, where C[k] = £ t fe =1 C[t] and K[k] = £t=i#M- 
3) The processes C[k] and R[k] are independent. 4) There 
exists a certain 9 > such that Ac{6) = A.r(—6). For the 
details of the sufficient conditions for Eq. (4), please refer 
to [8]. It has been shown that Eq. (4) holds for many stable 
queueing systems with typical arrival/departure processes [8], 
[13], such as Markovian processes, auto-regressive processes, 
and capacity-achieving service processes over i.i.d. block- 
fading wireless channels. 

Based on Eq. (4), the probability that the queue-length, 
denoted by Q, exceeding a given bound Q t h can be approxi- 
mated [8] by 

Pr{Q>Qth}«e- 8Q -, (5) 

where 9 > is a constant called QoS exponent. It is clear 
that the larger (smaller) 9 implies the lower (higher) queue- 
length-bound violation probability. Furthermore, when the 
delay bound becomes the main QoS metric of interests, the 
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delay-bound violation probability can be approximated [9], 
[20] by 

Pi{D> Ah} ^e- 0v ^ Dtb , (6) 

where D and Ah denote the queueing delay and delay bound, 
respectively, and ip(6) is known as the effective bandwidth [8] 
of the arrival-rate process under the given 9. When the arrival 
rate C[k] is equal to a constant C over all k and the departure 
rate R[k] is time-varying, Eq. (6) can be written [9], [20]-[22] 
as 



Pi{D > Ah} « e 



-0CD th 



(7) 



Then, to upper-bound Pv{D > Ah} with a threshold £, using 
Eq. (7), we get the minimum required QoS exponent 9 as 
follows: 

a lQ g(0 



CD 



(8) 



tii 



Consider a discrete-time arrival process with constant rate 
C and the discrete-time time-varying departure process R[k], 
where k is the time index. In order to guarantee the desired 
9 determined by Eq. (8), the statistical QoS theory [8]— [10] 
shows that the effective capacity C(9) of the service-rate 
process R[k] needs to satisfy 



C{9) > C, 



(9) 



under the given QoS exponent 9. The effective capacity 
function is defined in [9] as the maximum constant arrival rate 
which can be supported by the service rate to guarantee the 
specified QoS exponent 9. If the service-rate sequence R[k] is 
stationary and time uncorrected, the effective capacity can be 
written [13] as 



C(0)4-J]og(E{e- 9 *ra}) 



(10) 



where E{-} denotes the expectation. 

In our distributed MIMO system, the BS selection result 
is designed as the function determined by the current CSI. 
Thus, the corresponding transmission rate (service rate) is 
time independent under the independent block-fading model 
(see Section II-A). Then, applying Eqs. (8)-(9), the delay QoS 
constraints given by Eq. (2) can be equivalently converted to: 



E H {e" "*" - e- e " c " } < 0, , n = 1, 2, . . . , JV n 



(11) 



where 9 n = - log(£ n )/(C„ A^ 
expectation over all H. 



and Eh{-} denotes the 



IV. QoS -Aware BS Selection for the Single-User 

Case 

We focus on the scenario with a single mobile user in 
this section, where K mu = 1. For presentation convenience, 
we use the term transmission mode L to denote the case 
where the cardinality of the selected BS subset is equal to 
L. Given transmission mode L, we denote by Q,l the set of 
indices of selected BS's, where fix, = il,2, ■ ■ ■ , 

and ili £ {1, 2, . . . , -K"bs} for ^ = 1,2, ...,L. Once a 
BS is selected, we use all its transmit antennas for data 



transmissions. Then, we characterize the channel matrix for 

A 



the selected BS subset by Hn t and write Hq l as Hq l 
[Hi t i LA Hi,i L>2 • ■ ■ Hi jil t ], which is an Ni x Ml matrix 
with Ml — J2i=i^iL,i- Accordingly, the physical-layer 
signal transmission is characterized by 



where y is the N\ x 1 received signal vector and ^ de- 
notes the Ni x 1 additive Gaussian noise vector whose 

elements are independent with unit power. The variable sq l = 

i T 

s ih i ' s iL 2 ' • • • ' s ih l * s tne input signal vector (transmitted 
signal vector) for the MIMO channel Hq l , where Si L t is the 
M iL e x 1 signal vector transmitted from the {M ih e )-Xh BS. 

Clearly, for dynamic BS selections of distributed MIMO 
transmissions, we need to answer the following three ques- 
tions: (i) For a specified transmission mode L, how do we de- 
termine the BS subset Q^,? (ii) How are the wireless resources 
shared if applying multiple modes within a time frame? (iii) 
Which transmission modes will be used and how to quantita- 
tively allocate the wireless resources? We first study associated 
issues for question (i) in Section IV-A. Then, we introduce 
the time-sharing transmission and probabilistic transmission 
to share the resources across different modes in Section IV-B. 
Following the discussions in Sections IV-A and IV-B, we 
formulate two analytical optimization frameworks to answer 
question (iii). In particular, the first optimization framework is 
based on the incremental BS-selection algorithm and the time- 
sharing strategy; the second framework applies the ordered - 
gain based BS selection algorithm with probabilistic trans- 
mission, which will be detailed in Sections IV-C and IV-D, 
respectively. The BS-selection scheme derived under the first 
optimization framework can achieve better performance, while 
the second optimization framework is simpler to implement. 



A. BS-Selection Strategy Given the Cardinality of the BS 
Subset 

In this paper, we focus on the spatial multiplexing based 
MIMO transmissions. Given Ql in a fading state, the max- 
imum achievable data rate (Shannon capacity), denoted by 
R(£Il) (nats/frame), is determined [5] by 

^1)=^ {sriog[dct(l + H S2i S4 L )] } (12) 

for L = 1,2,..., Kb s under given H, where (-)t represents 
the conjugate transpose, det(-) generates the determinant of 
a matrix, Tr(-) evaluates the trace of a matrix, and S is the 
covariance matrix of sq l . Also, we define R(£Il) — for 
L = 0, implying that no BS is selected and thus no data 
is transmitted in this fading state. For the details on how 
to achieve MIMO capacity under the given power budget 
and how to allocate power across multiple transmit antennas, 
please refer to [5]. 

After obtaining R(CIl), we have the best selection strategy 
to optimize the achievable rate as follows: 



m&x{R(n L )}. 



(13) 
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00. Let <P := {1, 2, . . . , K hs } and * := 0, and Z = where 
is the empty set and |*| denotes the cardinality of the set 

! Use variables * and \P to store all selected BS's and all other 
BS's, respectively. 

01. For i := 1 to L ! Add one BS to 'J 7 in each step. 

02. For z := 1 to Z ! Examine Z BS's in ty, respectively 

03. @ z :=$U {ipz}, where %l> z is the 2th element in 

! Pick a BS of * to form a new BS subset Z with <F. 

04. R z := R(0 Z ) based on Eq. (12) by setting Q L := & z . 
! Examine the achievable rate of Z . 

05. End 

06. z* : = argmaxK z < z Vk z J ; 

! Select the BS to maximize the achievable rate. 

07. ¥ := e z », * := *\{V> Z *}. and z ■= 1*1; 

! Add the newly selected BS into the BS subset *. 

08. End 

09. Q L := *. ! Complete the BS selection and get Cl L . 

Fig. 3. The pseudo codes to determine Ql by using the incremental BS- 
selection algorithm for the single-user case. 

To derive the optimal solution for this optimization problem, 
it is clear that we need to examine all ( K * B ) possible BS com- 
binations, which leads to the prohibitively high computational 
complexity as M gets large. Alternatively, we consider two 
suboptimal approaches with low complexities as follows. 

1) . Incremental BS-Selection Algorithm: In [7], the au- 
thors developed the fast antenna selection algorithm using 
the incremental-selection strategy. Although this incremental- 
selection strategy was developed for antenna selection without 
CSI feedback, it can be readily extended to the scenario for 
BS selection with CSI feedback to achieve the near optimal 
data rate. The pseudo codes of the incremental BS-selection 
algorithm are given in Fig. 3. In particular, the idea of this 
algorithm is to determine £1^ through L steps, where in each 
step one BS is selected, as shown in lines 01-08 of Fig. 3. 
In each step, one selected BS is added to the BS subset 
denoted by fy, where the selection criterion is to maximize 
the achievable rate of the updated BS subset 'J?. Then, after 
L steps, we have totally added L BS's into 'J? and then 
assign Hl ■= \f r . This algorithm only examines the achievable 
rates for L(K^ S — (L — l)/2) different BS combinations, 
which requires L(Kh s — (L — l)/2) times of singular value 
decomposition (SVD) to calculate the MIMO channel capacity, 
resulting in much less complexity than the optimal approach 
which examines all (^ 3 ) BS combinations. 

2) . Ordered-Gain Based BS-Selection Algorithm: The 
ordered-gain (or ordered-SNR) based BS-selection algorithm 
selects L BS's with the largest aggregate power gain over all 
BS's, where the aggregate power gain is defined by Eq. (1). 
Since maximizing the aggregate power gain may not effec- 
tively optimize the achievable transmission rate for MIMO 
links, the incremental BS-selection algorithm usually domi- 
nates the ordered-gain based BS-selection algorithm. However, 
since the ordered-gain based BS-selection algorithm does not 
need to perform the SVD, its complexity is much lower than 
that of the incremental BS-selection algorithm. 

B. Time Sharing and Probabilistic Transmissions 

To get the more general framework for BS selection, we 
apply the time sharing and probabilistic transmission strate- 



gies, respectively, over different transmission modes, which 
are described as follows. 

1) . Time Sharing Transmissions: Each time frame can be 
divided into (K^ s + 1) time slots with the lengths equal to 
{Ta m }^ , where a m is the normalized time-slot length and 
Z)m=o a m = 1- Within the mth time slot for to > 1, the 
transmission mode L BS will be used; for m = 0, no data is 
transmitted in the corresponding time slot. Then, the total ser- 
vice rate in a time frame is equal to J2l=o a LR(^L), where 
R(n L ) is given by Eq. (12) for L ^ and R(Q L ) = for L = 
0. Furthermore, the total BS usage is given by Y^l=qLo.l- 
The purpose of applying the time sharing based transmissions 
is to increase the system flexibility and to gain the continuous 
control on the BS usage within each time frame. Accordingly, 
we need to identify how to optimally adjust a with CSI and 
QoS constraints, where a. = (a , ai, . . . , Qm)- 

2) . Probabilistic Transmissions: Under this strategy, within 
each time frame only one transmission mode will be used for 
distributed MIMO transmissions. In particular, we will select 
transmission mode L with probability equal to 4>l and define 
4> = (^o, 4>2, • ■ • , 0if bs ) r - Then, our target is to determine 
how to dynamically adjust <f> according to the CSI and QoS 
requirements. 

If setting 4> = a and using the same strategy to determine 
£Il over all fading states, we obtain the same BS usage. 
However, the effective capacities (see Eq. (10)) achieved under 
the time-sharing transmission and the probabilistic transmis- 
sion, denoted by Cts(«, #i) and Cpr(</>, 6*i), respectively, are 
different. Specifically, we derive 

CrsWJ = -^-log(E H {e-^S&o^(^)}) 

= -^log(E H {0Le- e ^-«^)}) 
= C PR (<Mi), (14) 

where the inequality holds because Eh |e _ ^i=« Q i- R (°i) j is 

convex over (R(Qo), R(Qi), . . . , R(flK hs ))- Eq. (14) suggests 
that the time-sharing transmission generally outperforms the 
probabilistic transmission. However, the probabilistic trans- 
mission is more realistic to implement than the time-sharing 
transmission due to the following reasons. On the one hand, for 
the optimized time-sharing transmission, the time-slot length 
Ta l may be quite small and thus very hard to implement. On 
the other hand, the multiple time slots (for the time-sharing 
transmission) within a time frame introduces more overhead 
than the single-slot case (for the probabilistic transmission). 

C. Optimization Framework Using Time-Sharing Transmis- 
sions with Incremental BS Selection 

As discussed in Section II-C, our major objective is to 
minimize the average BS usage. In this section, we focus on 
the framework which employs the incremental BS-selection 
algorithm for each transmission mode and apply the time- 
sharing transmission for different transmission modes. Then, 
we develop the efficient BS-selection scheme under the above 
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framework by solving the following optimization problem 
Al, which aims at minimizing the average BS usage while 
guaranteeing the delay-QoS requirement. 



Al : min {Li = min 
a(H) 1 > a(H) 



«l(H)L 

,L=0 



s.t: 1). > a L (H) = 1, VH; 



L=0 



(15) 



2). E H {e- 0l S&o^(H)fl(n i )_ e -9iC 1 J< 0i (16) 

where Eq. (16) is the constraint to guarantee the delay-bound 
violation probability as derived in Eq. (2), we obtain Ql 
through the incremental selection algorithm listed in Fig. 2, 
which varies with H, and we determine R(Ql) based on 
Eq. (12). Note that we denote the time-sharing vector (orig- 
inally defined in Section IV-B) by a(H) in problem Al to 
emphasize that a is a function of H. When the context is 
clear, we will then drop the symbol (H) and only use a to 
simplify notations without causing confusions. We call the 
optimal solution to Al, denoted by a*, as the incremental 
BS-selection and time-sharing based (IBS-TS) scheme. 

Note that the optimization over a is an (K\, s + 1)- 
dimensional problem. To reduce the dimension of optimization 
variables, for each H we define 



R(C) = max|^a L i?(0 L )| , for each H (17) 

s.t. oil = 1; OLj^h = C, for each H. (18) 



L=0 



L=0 



Based on Eqs. (17)-(18), R{C) is the maximum achievable rate 
over all a. with the same BS usage. Since R{C) is the convex 
combination [12] over {R(Ql)}l=v tnen tne definition in 
Eqs. (17)-(18) suggests that R{C) is a piece-wise linear and 
concave function, which can be written as 



R(C): 



R (m,j-i) + Vj (C 



o. 



_i), if£e(mj_i,m 
.7 = 1,2, . . . ,/C; 
if C = 0; 



(19) 



for a certain integer /C, where too < mi < • • • < m/c, mo = 0, 
to k = K hs , and rrii G {0, 1, . . . , -Kbs}' Moreover, using 
(mi,R(m,i)) to represent the coordinates of a point in the 
two-dimensional plane, we can identify { (rrii, R(rrii)) } 
through the following procedures: a), find vertices of the con- 
vex hull spanned by two-dimensional points { (L, R(L)) } . = b J; 
b). { (m^ R(m,i)) are located above the line segment 

with end points (0,0) and [K\, s , i?(.Kb s )). Accordingly, z/j is 
the slope of the line segment starting at the point (rrii, J?(m,)) 
and ending at the point (m,_i, J?(mj_i)), which is deter- 
mined by 



R(mi) - #(m;_i) 



TO,; 



TOi_i 



i = l,2,. 



(20) 



For presentation convenience, we also define = oo and 
i^/c+i — — oo. Note that R(C) defined in Eq. (19) and the 
associated variables, including Q L , JC, {rrij}f =1 , and {vi}f =l , 
are all functions of H. Furthermore, given C € [rrij-i, nij], 
following the piece-wise linear property^we derive the corre- 
sponding a to achieve the service rate R{C) as follows: 



m 



if L 
if L = rrij ; 
otherwise. 



(21) 



Applying Eqs. (17), (18), and (21) into problem Al, we 
can equivalently convert to the following problem Al': 



Al' 



min {E H {£(H)} } 

£(H) 1 



s.t.: E 



-0ifl(£(H)) 



< 0, (22) 



where £(H) is a function of H and we denote the optimal 
solution to problem Al' by £*(H). To simplify notations, 
we will drop the symbol H for problem Al'. After obtaining 
the optimal solution £*, we can then uniquely map C* to a 
through Eq. (21). Since R{C) is an increasing and concave 
function, e~ eiR ^ is convex over C [14, pp. 84]. Thus, we 
can see that Al' satisfies: a) the objective function is convex; 
b) the inequality constraint function E H {e~ 0lii ( £ ) - e~ 9lCl } 
is convex. Therefore, Al' is a convex problem [14, pp. 137]. 
Then, using the Lagrangian method, we solve for the optimal 
solution of Al', as summarized in the following Theorem 1. 

Theorem 1: The optimal solution to Al' , if existing, is 
determined by 



m, 



'3i 



if Vj 



< 



e eiR( mj ) 



'j+i 

log(0iX*Uj) R(m,j- 



"A 



T-j-l, 



(23) 



if R(rrij-i) < 



log(fliAVj) 



< R(rrij), 



where R(-) and v j are characterized by Eqs. (17) through (20). 
In Eq. (23), A* > is a constant over all fading states, which 
needs to be selected such that equality in Eq. (16) holds. 

Proof: The proof of Theorem 1 is provided in the 
Appendix A. ■ 
Remarks: (i) Having obtained the optimal C* for problem 
Al', the optimal solution to Al, denoted by a*, is obtained 
by setting C = C* in Eq. (21). (ii) Under the optimal solution, 
we do not allocate time slots for all transmission modes. As 
indicated by Eq. (21), within any time frames, we employ at 
most two transmission modes, (iii) It is clear that by setting 
OiK bB = 1 for all time frames, we use the maximum transmit 
power and thus obtain the maximum achievable effective 
capacity, which is denoted by Cmax- If Cmax i s st iU smaller than 
C\, the specified delay-QoS requirement cannot be satisfied 
since we have used up all power budget. As a result, no 
feasible solution exists for this case. In contrast, if Cmax > C\, 
we can always find the optimal solution, (iv) The analytical 
expressions for A* is usually intractable. Alternatively, we use 
the following numerical-tracking method to derive the optimal 
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A*. The details of the numerical-tracking method and the 
discussions on its complexity are provided in Appendix B. 

D. Optimization Framework Using Probabilistic Transmis- 
sions with Ordered-Gain Based BS Selection 

We in this section consider the framework using the ordered- 
gain based BS-selection algorithm and the probabilistic trans- 
mission strategy. Specifically, we formulate the optimization 
problem for this framework as follows: 



A2 : min {h\ = mirJ E H < V fe(H)i 

0(H)* 0(H) \ 



s.t: 1). 



E< 

L=0 



>l(H) = 1, VH; 



(24) 



2). E H |^^( H ) e " eifl(ai) J- e " eiC j< ' (25) 

where the probabilistic transmission vector is denoted by a 
function 4>(H) of H. When the context is clear, we drop the 
symbol (H) for problem A2 to simplify notations. We call 
the optimal solution to the optimization problem A2 as the 
ordered-gain and probability transmission based (OGBS-PT) 
scheme. 

Theorem 2: The optimal solution to problem A2, denoted 
by </>*, is given by 



1, if L = L* 
0, if L ^ L* 



(26) 



for all H, where 

L* =argmm{ J L + A*e- eii ^ i )}. (27) 

In Eq. (27), A* > is a constant over all H, which is selected 
such that the equality holds for Eq. (25) to guarantee the delay- 
QoS requirement. 

Proof: The detailed proof of Theorem 2 is omitted due to 
lack of space, but is provided on-line in the technical-report 
version [19] of this paper. ■ 
Remarks: Theorem 2 suggests that under the optimal solu- 
tion, the probabilistic transmission reduces to a deterministic 
strategy, where the only transmission mode L* will be used for 
data transmission. Similar to problem Al, the parameter A* 
for A2 also needs to be tracked through numerical searching. 

V. QoS -Aware BS Selection for the Multi-User 
Case 

We next consider the distributed MIMO transmissions for 
the case with multiple mobile users. 1 For efficient BS selec- 
tion and distributed MIMO transmissions, the central server 
controls the selected distributed BS's and the mobile users 
to constitute the broadcast MIMO link, as mentioned in 
Section II. Specifically, given the transmission mode 2 L and 

'We use the terms of "mobile user" and "user" exchangeably in the rest of 
this paper. 

2 For the multi-user case, we also use the term of transmission mode L to 
denote the case where the BS-subset's cardinality is L. 



BS-index subset SIl = {il.i, «l,2, ■ • • , of the selected 

BS's, the channel matrix of the nth mobile user, modeled by 
Hf?\ is determined by 



Hs) t [Hn,! 1(1 H„j [j2 • • • H nj j L|I J , n 1,2,..., K rnu 

where h[™^ is an N n x ^X^=i -^z, matrix. Then, the 
physical-layer signal transmissions can be characterized by 



yh 



(n) _ H (n) \p (i) („) „ _ 1 o K 



i=l 



where Sq represents the ith user's input signal vector for the 

(n) (n) 

MIMO channel H^', is the signal vector received at the 
nth user's receive antennas, and $(") is the complex additive 
white Gaussian noise (AWGN) vector with unit power for each 
element of this vector. 

It is well-known that the optimal capacity of the broad- 
cast MIMO link can be achieved through the dirty-paper 
coding techniques [16], which is, however, hard to imple- 
ment due to its high complexity [17]. Alternatively, we 
apply the block-diagonalization precoding techniques [17] 
for distributed MIMO transmissions and then concentrate 
on developing efficient QoS-aware BS-selection schemes and 
associated resource allocation schemes, which are elaborated 
on in the following sections. 



A. The Block Diagonalization Technique for Distributed 
MIMO Transmissions 

The idea of block diagonalization (BD) [17] is to use 

(n) 

a precoding matrix, denoted by T y n for the nth user's 

L (i) 

transmitted signal vector, such that T 



= for all 

+ n. By setting = T^s^, where s|" l } is the nth 
user's data vector to be precoded by we can rewrite the 
received signal as 



An) 



(n) 



iiWrW") i An) 



where T 



(n) 



Hi 



_(n) 
» j 

,(») 

n L - 



(28) 



Under this strategy, the nth user's 
signal will not cause interferences to other users. Accordingly, 
the MIMO broadcast transmissions are virtually converted 
to K mu orthogonal MIMO channels with channel matrices 
{Tq^ } n= Ti • Then, we can get the nth user's maximum achiev- 
able rate in a fading state, denoted by (Ql, Vjrf*), as 
follows: 



,(«) 



= max BT log 



det I - 



1 Or " 




s.t.: Tr 



(E (n) ) = 



,(«) 



(29) 



where is the covariance matrix of 



and V 



(n) 



is the power allocated for the nth user. Also, we define 
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J?(") (n L , Vf 1 ) = for L = 0. Note that the aforementioned 
may not exist, implying that the total number of trans- 
mit antennas over the selected BS's cannot support enough 
freedom for block-diagonalization. In this case, no data will 
be transmitted to the nth user to avoid interferences caused 

(n) 

to other users. For the procedures on how to determine Iq^, 
please refer to [17]. 



B. Priority BS-Selection Strategy Given BS Subset Cardinality 

When the transmission mode is specified, i.e., the cardinality 
of the BS subset is given, every user expects to select the BS 
subset that maximizes its own transmission rate. However, it 
is clear that this objective cannot be obtained for all users in 
the multi-user case. Moreover, the derivation of global optimal 
selection strategy in terms of minimizing the average BS usage 
is intractable due to the too high complexity, where we need 
to examine all (^ s ) possible BS combinations. Therefore, we 
propose a simple yet efficient BS-selection algorithm, called 
priority BS-selection (PBS), which is detailed as follows. 

For the nth user, the global maximum achievable transmis- 
sion rate is attained when all BS's are used and all the other 
users do not transmit. Thus, the maximum achievable rate is 
given by 

R^(n KbB ,v Khe ) 

{BTlog[det(l + H„E< n >Ht)]}. (30) 



= max 

B(«):Tr(s("))='P Kb 

Then, we get the maximum achievable effective capacity of 
the nth user, denoted by Cmax, as follows: 

Ctl = -i-log(E H { e - 9 ^ ( " , K bs ^)}) (31) 

for n — 1,2, ... , K mu . We further define the effective-capacity 
fraction for the nth user as the ratio between the traffic loads 
and the maximum achievable effective capacity. Denoting the 
effective-capacity fraction by C n , we have C n = C n /Cmlx- 
Clearly, for a higher C n , the nth user needs more wireless 
resources to meet its QoS requirements. Thus, in order to 
satisfy the QoS requirements for all users, we assign higher 
BS-selection priority to the user with larger C n . Following 
this principle, we design the priority BS-selection algorithm 
to determine CIl in each fading state and provide the pseudo 
code in Fig. 4. For presentation convenience, we sort {C n }n=i 
in the decreasing order and denote the permuted version by 

{ c *U)}h=i> where ^(l) > Ctt(2) >■■■> Cir(Jf mu ) indicates 
the order from the higher priority to the lower priority. In the 
rest of this paper, we use the term of user n(i) to denote the 
user with the ith largest effective-capacity fraction. 

As shown in Fig. 4, in each fading state the BS-selection 
procedure starts with the selection for user 7r(l), who has the 
highest priority. After picking one BS for user 7r(l), we select 
one different BS for user 7r(2). More generally, after selecting 
for user we choose one BS for user n(j + 1) from the 
BS-subset vp, which consists of the BS's that have not been 
selected. This procedure repeats until L BS's are selected. For 
user-7r(j)'s selection, we choose the BS with the maximum 
aggregate power gain (see Eq. (1) for its definition) over the 



01. Let * := {1,2,_. .,K hs } and * := 0, and I = |*|; 

! Use variables \P and * to store selected BS's and all other BS's, 
respectively. 

02. j := 1. ! User -rr(j) is selecting BS. 

03. While (t < L) ! Iterative selections until L BS's are selected. 

04. m* = argmin mgs ,{7 lr(j)im }. 
ln(j),m ls tne aggregate power gain associated with user 



! Select BS to maximum the aggregate power gain for user tt(j). 

05. f :=f U {m* }, * := *\{m* }, and I := I + 1. 

! Update <F, >t, and I. 

06. If j = K mn , then j := 1; else j := j + 1. 

! Let next user with lower priority to select BS. 

07. End 

08. Q L :=¥. ! Complete the BS selection and getfl L . 

Fig. 4. The pseudo codes to determine fi^ in each fading state by using the 
priority BS-selection algorithm for the multi-user case. 



subset "J/. In addition, after user-7r(if mu )'s selection, if the 
number of selected BS's is still smaller than L, we continue 
selecting one more BS for user 7r(l), as shown in line 06 in 
Fig. 4, and repeat this iterative selection procedure until having 
selected L BS's. 

C. The Optimization Framework for BS-Selection and Re- 
source Allocation 

C.l. Problem Formulation for Average BS-Usage Minimization 
We next study how to determine which transmission modes 
will be used, and how to derive the corresponding resource 
allocation strategy by integrating the block diagonalization 
and the priority BS selection. Similar to the OGBS-PT 
scheme for the single-user case, we also apply the prob- 
abilistic transmission (PT) for the multi-user case, where 
transmission mode L is used with a probability denoted by 
<f>L, L = 1, 2, . . . , Kt, s . Note that for each transmission 
mode, there are K mu coexisting links towards K mu mobile 
users. Consequently, we also need to determine how to al- 
locate the total power Vl to these K mu coexisting links. 
In particular, we describe the power allocation strategy in 
a fading state by V = (Vi,V 2 , . . . ,V Khs ) with Vl = 
(7f \ Vf, V ( L Kbs) ) for L - 1,2,..., K hs , where V 
denotes the power-allocation policy for the entire system 
and the vector Vl represents the power-allocation policy 
for transmission mode L. Then, we formulate the following 
optimization problem A3 to derive the optimal QoS-aware 
probability- vector cf>* = (<j>* , ^ , . . . , 4>* Kian ) and its corre- 
sponding power-allocation policy V* . 



A3: min {L\= min 

(0(H),P(H)) 1 ' f>(H),7»(H)) 

s.t: 1). £> L (H) = 1, 

L=0 

-ffmu 



Eh |x>l(H)L 



VH; 



(32) 



2) - E 7 'i" ) ( H ) = ^' VL ' H ; w 

71=1 

3) . E H |xJ^(H) e - 9 ^ ( " ) (^^" , (H))| 



< e -e„c nj (34) 
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the block diagonalization) can be converted to parallel 

In) ^(™) 

Gaussian sub-channels, where Z) ' is the rank of T 0r , the 




(n) / (n) 

zth sub-channel's SNR is equal to e L z , and \/e T '_ is the 

~(n) 

zth largest nonzero singular value of Tq l . The optimal power 
allocated to the zth sub-channel follows the water-filling 
allocation, which is equal to p^ z = [u^™* 1 — l/ £ i™z] + ' where 



fj,^ > l/£^™{ and p}^ 1 is selected such that X)z=i pT 



.(«) 



,(«) 



(n) 



,(») 



i(n) 



Since has only Z^ non-zero singular values, 

for presentation convenience, we define — oo for 

i = Z^ +1. Accordingly, we can show that 



d7> 



ST 



(35) 



(n) 



holds and that J?(") (Sl L , Vf) is strictly concave over V 

Moreover, if ^ e [l/ e g, l/ e g +1 ) for i = 1, 2, . . . , ^, 
we get: 



(a).P^=#- 



Z^ (n) 



Cb).flW(fi L ,Pi n) )=BTIog ( negjj+BTilog 



(n) 

A ■ 



= 1 



(36) 



C.3. r/ze Optimal Solution to A3 

Theorem 3: The optimal power-allocation policy "P* for 
optimization problem A3, if existing, is given as follows: 



BT0 n 
' 1 + BTOn 



n 



C: 



H.L 



Z-^ _(n) ' 



(37) 



for all n, L, and H, where is the square of Tq^'s jth 
largest singular value, and i* is the unique solution satisfying 
the following condition: 



(«) ^ 



Vn, L, H, 



(38) 



where 

(n) 
Mi 



where 0(H) and 'P(H) characterize the probabilistic trans- 
mission vector and the power allocation policy described in 
the beginning of Section V.C.I, both of which vary with H, 
and J?(") (n L , (H)) is determined through Eq. (29). After 
clearly define 0(H) and 7 > (H), we drop the symbol (H) in 
problem A3 to simplify notations. We call the optimal solution 
to A3 as the PBS-BD-PT scheme. 

C.2. The Properties of (Sl L , V^ } ) 

Before solving A3, we need to study the properties of 
R^(n L ,r { l l) ). Let us consider the nth user with 
not equal to zero. Similar to the results summarized in 

(n) 



Section IV-A, the nth user's MIMO channel T Ul (after w j tn 



BTO n 
' 1 + BTOj 



\BTe n X* n 



(39) 



The corresponding optimal probability-transmission policy is 
determined by 



,* / 1, if 
^ = \ 0, ot 



L = L* 
otherwise 



(40) 



L* = arg min < L 

0<L<K bs 



.^e-^WnJlx^-H'iog^)! VH, (41) 



71=1 



where given {A* }%=i for Eqs. (37)-(39), the optimal C H l k 
selected to satisfy the equation 2^rT=i C^zT^)* = f° r all L 
and H; {A*}^™J* are constants, which are selected such that 
the equality of Eq. (34) holds. 

Proof: We construct A3's Lagrangian function, de- 
noted by Ja3(0,"P;A,C h )> as Jas(0, V; A, C H ) = 
Eh{Ja 3 (0,P;A,C h )} with 



£^ L e-°» fl(B) ( ni ^ B) ) 



JA3(<t>,V; A, C H ) - y^^n 

n=l L-t=0 

- e -e«c n + ^ Chl JpW.^ + (42) 

L=l \n=l / L=0 

under 2^l=i 0l = 1> where A„ > for n = 1,2,..., if mu 
are the Lagrangian multipliers associated with Eq. (34), 
which are constants over all fading states, and A = 
(Ai, A 2 , . . . , Aif mu ); {(h,l}l=i ^ tne Lagrangian multipliers 
associated with Eq. (33), which vary with H and L, and 
Ch,l = (Ch,i, Ch,2, • ■ ■ , (n,K bB )- 

Problem A3's Lagrangian dual function [12], [14], denoted 
by Jas{\ Ch)' i s determined by 



-L,i*+l, 



J A3 (A,C H ) = min \j A3 (<P,V; A, C H )| 

= E H |mm{j A 3(0,-P;A,C H )}}- (43) 

We denote the minimizer pair in Eq. (43) by (</>, "P). Then, 
we can derive 

r *«» / 

4> = arg min I S^^AL 

*:ElM^=1 U=l V 

+ gA„ e -^ (n, ( n -^ n) )^|, VH, (44) 

where Eq. (44) holds by plugging Eq. (42) into Eq. (43) and 
removing the terms independent of <p. Solving Eq. (44), we 
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obtain 

6 l = 1 , if L = are; min 
°i<e<K hs 



{^+f> e - 9 " i^( " , ( ^" , ) 

I n=l 



(45) 



otherwise, <\>l = 0. Following the above derivations, "Pl needs 
to minimize the function Ja3(</>, V; A, Ch) given </>l = 1 
and </>j = for all j ^ L. Then, we define a set of 
functions Jas.l{'P\ A, Ch l) f° r ^ = 1> 2, • • • , -Kbs, where 
Ja3,l(V; A, Ch,l) - ^3(0,^; A, C H ,L)Ut=i;0 J =o J #L ■ 
Taking the derivative of Ja3,l('P; A, Ch,l) wxt - ^l™^ an d 
letting the derivative equal to zero, we get 

Ch,l - BTX n e n ^e-^ RM (^ n) ) = (46) 

for all n, L, and H, where ^ = dR™ (Sl L , V^) / dV { [ L) 
as given in Eq. (35). Plugging Eq. (36)-(b) into Eq. (46) and 

(n) 

solving for the optimal [i L under the boundary condition of 

^ i/ e i"i' we obtain E q- ( 39 ) with i = Since £ q- ( 36 ) 

is obtained under the condition of (Jjf* G [l/ e L™i' l/ e L™i+i)' 

' in) 

the variable i* in Eq. (39) must satisfy the condition of pt L G 
[l/e^"]», l/e^» +1 ), as shown in Eq. (38). Moreover, we can 
show that Ja3,l(7- > ; A, Ch.l) is a strictly convex function, and 
thus i* for Eq. (39) is unique. 

The Lagrangian duality principle [12] shows that 
^(A, Ch) = ^(</>j "P; A, Ch) is concave over A and 
C H - Moreover, the original problem (also called the 
primal problem) A3's dual problem is defined by 
max( A ; £ H ) { J(X, Ch)}- We denote the optimal objective 
of A3 by L . The equation L > max^f ) {^(A, Ch)} 
always holds [12], where the difference between L and 
max (A,c H ) { J{\ Ch)} is known as the duality gap [12]. We 
can further show that J(X, Ch) i s differentiable w.r.t. A and 
Ch- Then, based on the Lagrangian duality principle [12], we 
can obtain 

-e„ii (n) (ni,,-p[ n) ) 



dJ(X 



9Ch 



-e- 6 ^}, Vn; 
^ = (En=i u ^ - ^) .9(H)rfH, VL, H 



where 5(H) is the probability density function of H and rfH 
denotes the integration variable. As a result, the maximizer 
Ch, l must be selected such that £n=i ^ -V L = 0. Such 
a Ch l exists because Ch,l — > and Ch.l — * °o leads to 
■P L ™^ — > 00 and — ► 0, respectively, as indicated by 
Eqs. (36)-(a) and (39). 

Having obtained Ch> we next focus on the optimal A* to 
maximize J(X, Ch)- Due to the concavity of Jas{\ Ch)> 
dJ(X, Gji)/dX n is a decreasing function of A„. Also, we can 
readily show that dJ(A, Ch1/^"U„=o > 0. Then, if there 
does not exist A such that dj(\, Ch)/^« = for all n, we 
have A* — > 00 for some nth user and dJ{X, Ch)/^^« > 
always holds. For this case, we get L > J(X*,Cii) ~ * °°> 
implying that there is no feasible solution for A3. 



In contrast, if there exists A* such that dJ(X*\ Ch)/^« = 
for all n, the pair of (A*, Ch) is me optimal solution to the 
dual problem given by Eq. (47). Plugging dJ(X*, Ch)/^« = 
into Eq. (47), we can see that the effective-capacity con- 
straint required by Eq. (34) is satisfied for every user. Also note 
that the optimum for max(x,£ H ) {J(X, Ch)} is achieved by 
using ("P, <fr) under A* and Ch> which satisfy the constraints 
imposed by Eqs. (32)-(33). Therefore, all constraints for 
problem A3 are satisfied, implying that this policy is feasible 
to problem A3. As a result, L = max(x,c H ) {j(X, Ch)} 
holds with zero duality gap. Thus, this policy is the optimal 
solution to A3. Then, setting V* = V and <fi* = <fi with A* 
and Ch i n Eq. (45), we obtain Eqs. (40)-(41). Further plugging 
Eq. (39) into Eq. (36)-(a), we prove that Eq. (37) holds. 
Finally, comparing dj{\*, Cii)/dX n = w i tn Eq. (47), we 
show that the equality of Eq. (34) holds, which completes the 
proof of Theorem 3. ■ 

Note that there are no closed-form solutions for the opti- 
mal Lagrangian multipliers Ch.l an d A*. However, we can 
determine the values of Ch l an d A* by using the numerical 
searching method similar to the approach given in Appendix B. 
The detailed searching techniques are omitted due to lack 
of space, but are provided on-line in the technical-report 
version [19] of this paper. 

D. The TDMA Based BS-Selection Scheme 

We next propose the TDMA based QOS-aware BS-selection 
scheme for the comparative analyses. In the TDMA based BS- 
selection, we also apply the priority BS-selection algorithm 
given by Fig. 4 when transmission mode L is specified. For 
transmission mode L, we further divide each time frame into 
K mu time slots for data transmissions to K mu users, respec- 
tively. The nth user's time-slot length is set equal to T x t^, n 
for n = 1, 2, . . . , K mu , where t^„ is the normalized time-slot 
length. Moreover, we still use the probabilistic transmission 
strategy across different transmission modes, where the proba- 
bility of using transmission mode L to transmit data is equal to 
4>l- Then, we derive the TDMA and probabilistic transmission 
policies through solving the following optimization problem 
A4. 



(47) A4 : 



min {L}= min (e h (v^l(H)1 

(t(H),0(H)) 1 ' (t(H),0(H))| Yf^ Q J 



s.t.: 1). ^^(H) = l, VH, 



(48) 



L=0 



2). ^t iin (H) = l, VH, L = l,2,...,K bs , (49) 



71=1 



3). E H I J2M^)e- entL - n{Ii)RM{QL ' VL) 



,L=0 



< e 



-o„c„ 



Vn, (50) 



where 0(H) and t(H) are the probabilistic transmission 
vector and the time-division policy. In particular, in prob- 
lem A4 we have 0(H) 4 («fo(H), <£i(H), ■ • ■ , <fo m „(H)) 
and t(H) 4 (ti (H) , ti (H) , . . . , tj£- bs (H) ) with t L == 
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Fig. 5. The deployment 
(a) Single-user case: K^ s = 

(-7.83, 13.33), (25.50, -22.49), (17.98, 25.00), and (-26.34, 11.62); the 
mobile station's coordinates are (4, —11). (b) Multi-user case: K bs = 6, 
whose coordinates are (35.77,22.69), (13.06,-37.45), (27.15,-26.33), 
(-40.28,-0.14), (-32.86,-28.65), and (-5.10,29.98); ftT mu = 3, 
whose coordinates are (—11, 0), (3, 5), and (2, —12). 



(W(H),t Ll2 (H),... > t LiJrb .(H)). We use 0(H) and t(H) 
to denote the probabilistic transmission vector and the time- 
division policy to address that they are function of H. When 
the context is clear, we drop the symbol (H) in problem A4 
to simplify notations. We call the optimal solution of problem 
A4 as the PBS-TDMA-PT scheme. 

Theorem 4: Problem ^44's optimal solution pair (t*, </>*), 
if existing, is determined by 



t* — 

l L,n — 



1 



(n L ,p L ) 



log 



s k,L 



(51) 



for all L, n, and H, and 

1, if L = arg 



mm 

=0,1,.. ..if 



J' 



0, otherwise 



-««ti,„fl (n) (^.^)|j ( 52 ) 



for all L and H, where <5j^ L under given {A* }^=i is 

determined by satisfying J2n=i *i n = ^ m ^ {Ki}n=i nee ds 
to be selected such that the equality of Eq. (50) holds. 

Proof: The detailed proof of Theorem 4 is omitted due to 
lack of space, but is provided on-line in the technical-report 
version [19] of this paper. ■ 

VI. Simulation Evaluations 

We use simulations to evaluate the performances of our pro- 
posed QoS-aware BS-selection schemes for distributed MIMO 
links. In particular, for the single-user case, the BS's deploy- 
ment and the mobile user's position are shown in Fig. 5(a), 
where K\, s = 5. For the multi-user case, the BS's deployment 
and the mobile users' positions are given by Fig. 5(b), where 
-Kbs = 6 and K mu = 3. Moreover, we set T = 10 ms and 
B = 10 5 Hz throughout the simulations. In the simulations, 
we focus on the case that the positions of users do not change. 



This can be mapped to the scenarios where users move slowly 
(e.g., pedestrians) or do not move. In such scenarios, the 
users' statistical channel conditions, depending on the dis- 
tances between users and distributed BS's, often change very 
slightly in tens of seconds, during which many (thousands of) 
packets will be transmitted before the users' statistical channel 
conditions vary significantly. Then, our developed statistical 
delay-QoS aware BS-selection schemes, which are based on 
the current statistical channel information, can effectively 
guarantee the corresponding delay QoS during this period. 
For simulations of scenarios that the users move very quickly, 
the delay-QoS-aware BS-selection schemes highly depend on 
the users' moving speeds and trajectories. These requirements 
are different from the requirements on typical simulation 
environments [23] for transmission error rate evaluations in 
cellular networks, where only the distribution information of 
users' positions are needed. As a result, the delay-QoS-aware 
BS-selection schemes to be re-designed, which will be studied 
in our future work. 

In the simulations, we employ the following average power 
degradation propagation model [15]. For the given reference 
distance d le {> if the transmission distance, denoted by d, is 
smaller than or equal to d le {> the free-space propagation model 
is used; if d > d Ic i, the power degradation is proportional to 
(d/d IB {) v , where r\ is the path loss exponent and typically 
varies from 2 to 6 indoor environments without LOS [15]. 
Accordingly, the variance h n , m of -ff„, m 's elements can be 
determined by 



if dn,m 6 (0, d r ef] 



G 



(fe)' 



if gL 



6 (d ref , oo), 



where d n , m is the distance between the mobile user and the 
mth BS, and G is aggregate power gain generated by the an- 
tenna and other factors. In simulations, we set d re f = lm [15] 
and -q = 3. Furthermore, we select G such that h n , m = dB 
at d ntm = 50 m. Also, we set of h = dB for the evaluation 
of the average interfering range (see Section II-C for its 
definition). 

We also consider the following baseline BS-selection 
schemes for comparative analyses: 

1). Base-Station Selection with Fixed Cardinality for Single- 
User Cases 

This scheme does not change the selected BS-subsect's 
cardinality over all fading states. The fixed cardinality, denoted 
by Lfi x , is determined by 



-Lfi x = min < L 



E 



L is fixed 



(53) 



where £Il for each H is determined through the ordered-gain 
based BS-selection algorithm given by Section IV-A, as also 
used by our derived OGBS-PT scheme. 

2). Optimal Time-Sharing Based Scheme for Single-User 
Cases 

The optimal time-sharing (TS) based scheme is the same 
as our proposed IBS-TS scheme except that it uses exhaustive 
search to find the optimal Q,l for Eq. (13) in each fading 
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Fig. 6. Simulations of the single-user case for the specified delay-QoS requirements given by ' = 50 ms and £i = 10 , where K ba = 5, each BS has 
two transmit antennas, the mobile user has two receive antennas, V rf! { = 4, and k = 2.4. (a) Average BS usage versus traffic load, (b) Average interfering 
range versus traffic load. 




Required delay bound (ms) 
(a) (b) 

Fig. 7. Simulation results for single-user case varying the specified delay-QoS requirements and system parameters, where K^ B = 5, each BS has two 
transmit antennas, the mobile user has two receive antennas, (a) Average BS usage versus k, where D$ = 50 ms, £1 = 10~ 4 , Ci = 1.1 Mbps, and 
■Pref = 1. (b) Average BS usage versus delay-bound QoS requirements, where £i = 10 — 6 , C\ = 600 Kbps, k = 0.4, and Vref = 1. 



state. It is clear that the optimal TS based scheme provides 
the lower-bound of the average BS usage for our proposed 
IBS-TS scheme. 

3). Semi-Random BS-Selection Scheme for Multi-user Cases 
In this scheme, given the cardinality L, the central controller 
randomly selects SIl with the uniform distribution across 
all BS's. Furthermore, the cardinality L of the selected BS 
subset is determined by using the algorithm derived through 
Theorem 3. We call this scheme as the semi-random BD-PT 
scheme in that the selection of L is not in a random manner. 
We evaluate the semi-random BD-PT scheme to provide an 
upper-bound of the average BS-usage for our derived priority 
BS-selection algorithm given in Fig. 4. This semi-random 
strategy can be also applied to the TDMA based scheme, 
where Hl is randomly selected and L is determined by using 
the algorithm obtained through Theorem 4. 

Figures 6(a) and 6(b) plot the average BS usage and the 
average interfering range, respectively, versus the incoming 
traffic loads for the single-user case. As shown in Fig. 6(a), 
our proposed IBS-TS and OGBS-PT schemes both effectively 



decrease the average BS usage and the interfering range as 
compared to the fixed BS-selection scheme. This is expected 
because our proposed BS-selections can adaptively adjust BS 
selection in each fading state based on the CSI, traffic loads, 
and the QoS requirements. In contrast, the resulted average 
BS usage and the interfering range by applying the fixed 
BS-selection scheme cannot smoothly vary with traffic loads, 
which may cause unnecessary BS usage with more power 
consumption and thus larger interferences to the entire wireless 
networks. We also observe from Fig. 6 that the IBS-TS scheme 
needs less BS usage than the OGBS-PT scheme to support the 
incoming traffic loads under the specified QoS requirements 
and therefore generates lower interferences accordingly, which 
verifies our discussions in Sections IV-A and IV-B. However, 
we can see that the performance differences between the IBS- 
TS and OGBS-PT schemes are slight, especially when the 
incoming traffic loads are relatively high. In addition, we can 
observe from Fig. 6 that the BS-usage performance and the 
interfering range of our proposed IBS-TS scheme are almost 
the same as compared to the optimal TS scheme, which verifies 
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the effectiveness of our proposed IBS-TS scheme. 

Figure 7(a) depicts the dynamics of the average BS usage 
and the interfering range as functions of k, where k is the 
power increasing rate against the number of selected BS as 
defined in Eq. (3). When k gets larger, the power budget 
used for distributed MIMO transmission in each fading state 
increases. As a result, the average BS usage is reduced 
correspondingly for all schemes, as illustrated in Fig. 7(a). 
Again, Fig. 7(a) demonstrates the inflexibility of the fixed 
BS-selection, and the significant BS-usage reduction and the 
interfering -range reduction as a result of using our proposed 
IBS-TS and OGBS-PT schemes. 

Figure 7(b) illustrates the impact of delay-QoS requirements 
on the average BS-usage among different BS-selection al- 
gorithms. As shown in Figures 7(b), the average BS-usage 
decreases as the required delay-bound increases, which is 
expected because of the following reasons. When the delay 
QoS constraint becomes looser, suggesting a larger delay 
bound, the central controller does not have to use more BS's 
to improve the transmission rate when the wireless channel 



experiences deep fading. Correspondingly, the efficiency of 
using the distributed BS's can be improved, thus reducing 
the average BS usage. The above observations verify that our 
proposed BS-selection algorithms can effectively adapt to the 
variation of delay QoS requirements. Also, from Figure 7(b) 
we can observe again that the IBS-TS results in less BS usage 
than the OGBS-PT scheme, and the IBS-TS scheme achieves 
almost the same performance as the optimal TS scheme. 

Figure 8 compares the average BS usage and interfering 
range among our proposed BS-selection scheme for multi- 
user case under various system parameters. Fig. 8 shows 
that as the traffic loads increase, all scheme's average BS 
usages and interfering ranges increase to satisfy the specified 
QoS requirements for the incoming traffic loads. Moreover, 
the PBS-TDMA-PT scheme's BS usage increases much more 
rapidly than the PBS-BD-PT scheme. As shown in Fig. 8(a), 
the PBS-BD-PT scheme results in less average BS usage than 
the PBS-TDMA-PT scheme when the traffic load is high. 
The superiority of the PBS-BD-PT over the PBS-TDMA- 
PT scheme gradually vanishes as the traffic loads become 
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low. Furthermore, we can observe that given M m = 6, the 
PBS-BD-PT scheme achieves lower BS usage than the PBS- 
TDMA-PT scheme under various traffic loads. This is expected 
since it has been demonstrated that the BD technique usually 
outperforms the TDMA technique in achieving high through- 
put. However, for M m = 4, the PBS-TDMA-PT scheme leads 
to less BS usage when the traffic load is relatively low. The 
above result is caused by the following reasons. In order to 
support simultaneous transmissions across multiple users by 
using the BD technique, a necessary condition is that the 
total number of transmit antennas over all distributed BS's 
can provide enough freedoms for spatial multiplexing. Thus, 
if the number M m of transmit antenna per BS is smaller, 
more BS's have to involve the transmission to gain enough 
spatial-multiplexing freedoms, even when the traffic load is 
low. In contrast, when the traffic load is very low, TDMA does 
not have such requirement on spatial multiplexing because 
it achieves cross-interference-free transmissions through time- 
slot division. The above observations will motive us to develop 
hybrid TDMA-BD BS-selection schemes in future studies. 

Figure 8(a) further illustrates that our proposed PBS-BD-PT 
and PBS-TDMA-PT schemes in general yields less BS usages 
than the semi-random BD-PT and the semi-random TDMA- 
PT schemes, respectively. Note that when M m — 4, the BS- 
usages of the semi-random BD-PT scheme and our proposed 
PBS-BD-PT scheme are the same, which also increase very 
slowly as the traffic loads increase. This is because for small 
number of transmit antennas, more BS's are needed to support 
the space-multiplexing, as aforementioned. When more BS's 
are used, the achievable rate for each dimension/user often 
exceed the traffic load for the corresponding user when the 
traffic load is low. In such a case, even randomly selecting BS 
can satisfy the specified delay-QoS constraint under the given 
traffic loads. However, even when the BS usages of the semi- 
random BD-PT scheme and our proposed PBS-BD-PT scheme 
are the same, our proposed we can see from that Fig. 8(b) that 
the PBS-BD-PT scheme results in a smaller interfering range 
by selecting the BS based CSI. 

Figure 9(a) plots the dynamics of the BS usage versus the 
required delay bound. As depicted in Fig. 9(a), the larger 
delay bound leads to less BS usage for all schemes, which 
demonstrates that our proposed schemes can effectively adjust 
the BS selections to meet diverse QoS requirements across 
multiple co-existing users. Also, we can see that the semi- 
random based schemes cause significantly higher BS-usages 
than our derived PBS based schemes. Fig. 9(b) illustrates 
our proposed PBS-BD-PT scheme and PBS-TDMA-PT based 
scheme's performances versus n (see Eq. (3)) under diverse 
QoS requirements. Similar to the single-user case, the average 
BS usage deceases as k gets larger. Moreover, as shown 
in Fig. 9, the lower delay bound and the smaller violation 
probability threshold, implying more stringent delay-QoS re- 
quirements, cause more BS usages and thus larger interfering 
ranges. 

VII. Conclusions 

We proposed the QoS-aware BS-selection schemes for the 
distributed wireless MIMO downlink to minimize the BS 



usages and to reduce the interfering range affected by the 
distributed MIMO system, while satisfying diverse statistical 
delay-QoS constraints characterized by the delay-bound vi- 
olation probability and the effective capacity technique. For 
the single-user scenario, we developed the scheme using the 
incremental BS selection and time-sharing strategy and also 
developed the scheme employing the ordered-gain based BS- 
selection and probabilistic transmission strategy. The former 
scheme archives better performance, while the latter scheme 
is easier to implement. For the multi-user scenario, we de- 
veloped the joint priority BS-selection, block-diagonalization 
precoding, and probabilistic transmission schemes. We also 
studied the TDMA based BS-selection scheme for multi-user 
link. The extensive simulation results show that our proposed 
schemes can effectively support the incoming traffic loads for 
the specified QoS requirements and significantly outperform 
baseline schemes in terms of minimizing the average BS usage 
and the interfering range. 

Appendix A 
Proof of Theorem 1 

Proof: We construct the Lagrangian function of Al, 
denoted by Jai(£, A), given as Jai{£, A) = E H { Jai(A A)} 
with 

J A1 (C, A) = C + A ( e - - e- 6 ^) , (54) 

where A is the Lagrangian multiplier associated with the 
constraint of Al' . Then, the optimal C* and the optimal 
Lagrangian multiplier A* are solutions to the following equa- 
tions [12]: 

e B c Jai{C,X), VH; 
= E H {e- 01 «( £ )-e-^}. (55) 

where 8cJai(^, A) denotes the subdifferential [12] of the 
function Jai(A A) with respect to C. Note that the subd- 
ifferential is defined for nondifferentiable convex functions 
(e.g., piece-wise linear functions), which is the counterpart 
concept for the gradient of differentiable convex functions. 
Based on [12], for a convex function /(b) defined on b G 
Bel, where M is real-number set and B is a convex set, 
an n x 1 real-valued vector za is a subgradient of h(h) if 
/i(b') > h(h) + a7 T (b' - b) for all b' e B. Accordingly, the 
collection of subgradients at b is the subdifferential of h(h). 
For more details and properties of subdifferential, please refer 
to [12]. 

Applying the piece-wise linear property and the concavity 
of R{C), we derive 

d c JAi(C,X) 

1 - e x \vje- ^ m =\ 1 - 9i\v j+1 e- e ^ m ^, 

if C = m-j, j = 0, 1, . . . , K\ 
= < , ~ , (56) 

|l-0 1 A^e- ( W)}, 

if £e(TOj_i,TOj), j = 1, . . . ,/C. 

Plugging Eq. (56) into Eq. (55) and solving for the optimal 
solution, we get Eq. (23). Also, the equality of Eq. (16) needs 
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to hold as required by Eq. (55). which completes the proof of 
Theorem 1. ■ 



Appendix B 

The Numerical-Tracking Method to Derive A* and 
Discussions on Its Complexity 

As defined in Appendix A, the Lagrangian function under 
the optimal solution derived in Theorem 1 is given by 

Jai(£*, A) = E H {£*} + AE H [e- ^ {c ^ - e^ 1 } . 

Note that in this equation we remove the superscript (•)* 
of A* because we have not obtained the actual value of A*. 
Based on convex optimization theory, J7ai(£*,A), known as 
the Lagrangian dual function [12], [14], is concave over A and 
the derivative dJ A1 (C* ,X)/dX = E H { e -^ R< - c ^ - e - 6 ^}. 
Moreover, if the optimal solution exists, J7ai(£*,A) reaches 
its maximum at A = A* with EHle - " 1 ^*' - e^" 10 ' 1 } = 0. 
Therefore, we can use the gradient descent method [14] to 
search for A* numerically. The gradient descent method is 
implemented through the following iterative equation: 



A := A + eE : 



-eiR{c*) 



9lCl }, 



where e is a positive real number much smaller than 1 . For the 
fast tracking of A*, we can use the time average via the first- 
order autoregressive low-pass filter to estimate and replace the 
expectation EH{e" 9lIi ' £ ) — e~ 9lCl } for the above gradient 
descent method. In particular, we generate a large number 
(more than 10 4 ) of channel realizations, representing different 
fading states, which are indexed by time sequence k = 
1,2, . . .. Denoting the estimate of EHle" 8 ^^*' - e^ dlCl } 
at the fc-th time by J{[k], we use the following first-order 
autoregressive low-pass filter to update J{[k): 

J'Ak] 



*7a[*-1] 
+(1-0) (e 



-eifl(£*[fe]) 



H=H[fe] 



where d e (0,1) is a real number close to 1, C*[k] is 
determined by using Eq. (23) under the current A of the fcth 
time frame, and H[fc] denotes the CSI at the fcth time frame. 
With appropriately selected e and the above algorithm can 
converge and oscillate around the optimal A* within a small 
range. The above tracking method has been used and validated 
in many optimization problems for resource allocation over 
fading channels. By using this algorithm, we can conveniently 
find the optimal solution without deriving the closed-form 
expressions for the expectation and A*, which are usually hard 
to obtain for general fading channel models. 

Under the above strategy, the computational complexity 
for our proposed BS-selection scheme in each fading state 
is dominated by: 1) the incremental BS-selection algorithm 
given in Fig. 2 to determine fix,; 2) singular value decompo- 
sition (SVD) of the channel matrix to calculate the MIMO 
capacity [5]; 3) the procedures to find the convex hull of 
{(L,R(L))}f^ for deriving Eq. (19). It is not difficult 
to see that the complexity of the incremental algorithm is 
upper-bounded by 0(K^ S ); the complexity for SVD is known 



upper-bounded by 0(K^), where K-£ = JW m ; the 

complexity to identify the convex hull is upper-bounded by 
0(.Kbs log(-Kbs))- Therefore, the total complexity for each 
fading state is upper-bounded by 0(K^), implying the major 
computational complexity is caused by the calculation for the 
MIMO capacity. 
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